Combining Features at Search Time: PRISMA at Video Copy Detection Task

نویسندگان

  • Juan Manuel Barrios
  • Benjamin Bustos
  • Xavier Anguera
چکیده

Most of current Video Copy Detection systems (VCD) perform a multimodal detection by dividing the system into subsystems. Each subsystem performs a copy detection using a different feature (either visual or audio), and the sets of candidates are combined (fused) to create the final result. We present a VCD system that fuses visual and audio descriptors at the similarity search level. The system produces the copy candidates by comparing video segments using visual and audio descriptors instead of fusing copy candidates from independent subsystems. We submitted four Runs to TRECVID 2011 CCD task: • PRISMA.m.balanced.EhdGry: a combination of two visual global descriptors. Two detection candidates per query. • PRISMA.m.balanced.EhdRgbAud: a combination of two visual global descriptors and one audio descriptor. Two detection candidates per query. • PRISMA.m.nofa.EhdGry: a combination of two visual global descriptors. One detection candidate per query. • PRISMA.m.nofa.EhdRgbAud: a combination of two visual global descriptors and one audio descriptor. One detection candidate per query. Our Runs achieve good detection effectiveness, especially for NoFA profile, and they are among the fastest Runs. To the best of our knowledge, this is the first VCD system that successfully fuses audio and visual descriptors at an earlier stage than decision level. Additionally, we have performed a joint submission with Telefonica Research team, under the name Telefonica-research.m.balanced.joint, which tests the combination at the decision level of Telefonica’s local descriptor, audio descriptor, and PRISMA’s EhdRgb global descriptors.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Telefonica Research at TRECVID 2011 Content - Based Copy Detection

This notebook paper summarizes the algorithms behind Telefonica Research participation in the NIST-TRECVID 2011 evaluation on the Video Copy Detection task. This year we have focused on 1) Improving the image-based matching system to better process video files; 2) implemented and tested a novel audio local fingerprint; and 3) improved the multimodality fusion algorithm from last year. For this ...

متن کامل

Telefonica Research Content-Based Copy Detection TRECVID Submission

This notebook paper presents the systems presented by Telefonica Research within the MESH team for the task of Video copy detection in TRECVID 2009. We participated in the Video-only, Audio-only and Audio+Video tasks. Our main contribution is the combination (when possible) of audio and video features within the same system by using global features extracted both from the reference videos and t...

متن کامل

An Improved Fast Video Clip Search Algorithm for Copy Detection using Histogram-based Features

In this paper, we present an improved fast and robust search algorithm for copy detection using histogram-based features for short MPEG video clips from large video database. There are two types of histogram features used to generate more robust features. The first one is based on the adjacent pixel intensity difference quantization (APIDQ) algorithm, which had been reliably applied to human fa...

متن کامل

Fast and Robust Short Video Clip Search for Copy Detection

Query by video clip (QVC) has attracted wide research interests in multimedia information retrieval. In general, QVC may include feature extraction, similarity measure, database organization, and search or query scheme. Towards an effective and efficient solution, diverse applications have different considerations and challenges on the abovementioned phases. In this paper, we firstly attempt to...

متن کامل

Content-Based Video Copy Detection: PRISMA at TRECVID 2010

We present PRISMA’s Video Copy Detection system (P-VCD). The system is based on visual-only global descriptors, weighted combinations of distances, a pivotbased index structure, and a novel approximated search and voting algorithm for copy localization. We submitted four Runs to TRECVID 2010 CCD task: PRISMA.m.balanced.ehdNgryhst: a combination of edge histogram and gray histogram. PRISMA.m.bal...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011